PyDigger - unearthing stuff about Python


NameVersionSummarydate
gtrbench 0.0.1 A benchmark to evaluate implicit reasoning in LLMs using guess-the-rule games 2025-01-19 01:58:11
Ali Shazal (with Michael Lu, Xiang Zheng, Juno Lee, Arihant Choudhary)
hourdayweektotal
7514887252285094
Elapsed time: 1.29006s